SINGLE NUCLEOTIDE POLYMORPHISM IN THE FGF-3 GENE 
AND METHODS OF USE THEREOF 

By Yongjun Guo 

This application claims priority under 35 
U.S.C. §119 (e) to US Provisional Application No. 
60/455,689, filed March 17, 2003, the entire content 
which is incorporated by reference herein. 

FIELD OF THE INVENTION 

The presence invention relates to the discovery of a 
single nucleotide polymorphism (SNP) in the FGF-3 gene 
and methods of use thereof. 

BACKGROUND OF THE INVENTION 

All publications, patent applications, patents, and 
other citations mentioned herein are incorporated by 
reference in their entirety . 

An individual's susceptibility to cancer is governed 
by the individual's genome and carcinogenic stimuli 
encountered in the environment. Although the 
carcinogenic potential of many compounds and other 
stimuli (e.g. ionizing radiation) have been determined, 
assessing the importance of limiting exposure to such 
compounds is complicated by the fact that not all 
individuals are equally susceptible to their deleterious 
effects. Accordingly, the genetic component of 
carcinogenic susceptibility confounds accurate prediction 
of cancer rates among individuals, even in defined 
environments . 

The FGF-3 gene encodes a secreted protein belonging 
to the fibroblast growth factor (FGF) family (1) . FGF-3 
was originally identified as a proto-oncogene (at first 



designated int-2) transcriptionally activated by proviral 
integration in virally induced mouse mammary tumors (2, 
3). In a colon carcinoma cell line, it has been shown 
that the FGF-3 gene is constitutively expressed in cells 
5 from tumorigenic clones but is completely silent in non- 

tumorigenic clones (4) . Amplification of FGF-3 gene has 
been observed in human esophageal (5-10), breast (11-22), 
ovarian (17, 18, 23-29), prostate (30-32), and head and 
neck cancer (5, 33-39). Earlier studies have revealed 

10 that 62% of primary esophageal cancers and 100% of these 

cancers which had metastasized to the lymph node had 
amplification of FGF-3 gene as demonstrated by Southern 
blot hybridization . 

In 1999, Djenabi et al. (40) published the 5' 

15 untranslated region and promoter sequence upstream of the 

human FGF-3 gene (GenBank Accession No. Y12377) . In 
2 000, Galdemard et al . identified a putative minimal 
promoter in the 5 ' -proximal region of FGF-3, about 6 kb 
upstream of the FGF-3 gene (41) . Additionally, it has 

2 0 been previously demonstrated that this minimal promoter 

of FGF-3 shares about 45% of sequence homology with the 
human ODC promoter region. 

SUMMARY OF THE INVENTION 

25 The present invention provides an isolated nucleic 

acid molecule having the sequence of SEQ ID NO : 1 or a 
complementary sequence thereof, a vector comprising the 
same, and a host cell comprising such a vector. 

Also provided is a method for detecting a single 

3 0 nucleotide polymorphism (SNP) in the FGF-3 gene in a 

mammal, which comprises a) isolating a nucleic acid 
sample from the mammal; and b) determining whether a 
cytosine or thymine is present at position 69 of SEQ ID 
NO: 1. In a particular embodiment, the method further 
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comprises amplifying a reference portion of the mammal's 
genome, which reference portion comprises the nucleotide 
residue located at position 69 of SEQ ID NO: 1. In a 
further embodiment, the method also comprises the step of 
5 annealing one or two oligonucleotide probes with a target 

portion of the mammal's genome prior to amplifying the 
reference portion, which target portion includes the 
nucleotide residue located at position 69 of SEQ ID NO: 
1. Fluorescent label (s) and optionally fluorescent 

10 quencher (s) may be attached to the one or both 

oligonucleotide probes. 

Also provided by the present invention is a kit for 
detecting a SNP in the FGF-3 gene in a mammal, which 
comprises a) at least one optionally detectably labeled 

15 oligonucleotide probe that anneals specifically with a 

target portion of the mammal's genome; and b) a pair of 
primers for amplifying a reference portion of the FGF-3 
gene, which reference portion includes nucleotide residue 
located at position 69 of SEQ ID NO : 1. 

2 0 The present invention further provides a method for 

assessing the relative susceptibility of a mammal to 
cancer, which comprises the detection of a SNP in the 
FGF-3 gene in a mammal. The presence of a cytosine at 
position 69 of SEQ ID NO : 1, being indicative of a 
25 greater susceptibility to the cancer than a mammal which 

does not comprise a cytosine at the position. Such 
cancers include, but are not limited to, esophageal, 
breast, ovarian, prostate, and head and neck cancer. 

3 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a nucleotide sequence showing the 
variant nucleotide at position 69 in SEQ ID NO: 1 (i.e., 
the C or T allelle) present in the upstream untranslated 
region of the FGF-3 gene, extending from nucleotide 
35 residues 4945 to 5508, relative to the published sequence 
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of FGF-3 gene upstream flanking region (GenBank Accession 
No. Y12377) . The SNP is located at position -6693 
relative to the ATG start codon for FGF-3. 

Figure 2 is a diagram depicting an embodiment of the 
allelic discrimination method of the invention. "R" 
(reporter) refers to a fluorescent label (e.g. a dye such 
as VIC, FAM, TET, JOE, or HEX) , and "Q" (quencher) refers 
to a fluorescence quencher (e.g. TAMRA) . 

Figure 3 is a graph depicting the distribution of 
human FGF-3 genotypes in different ethnic groups, i.e., 
African American, Caucasian, and Chinese. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is based on the discovery of a 
single nucleotide polymorphism (SNP) in the 5 ' -proximal 
region of the FGF-3 promoter, in which a cytosine is 
substituted for a thymine at position 69 in SEQ ID NO : 1 
(Figure 1) . The SNP occurs at position -6693 relative to 
the ATG codon of the FGF-3 gene. The polymorphism is 
believed to create a new transcription factor binding 
sequence in the FGF-3 promoter region. The presence of 
the C-allele in the FGF-3 gene in an individual is 
predictive of a predisposition to certain types of 
cancers, including without limitation, environmentally- 
induced esophageal squamous cell carcinoma. 

I. Definitions 

Various terms relating to the biological molecules 
of the present invention are used hereinabove and also 
throughout the specifications and claims. 

As used herein, a mammal "comprises a C-allele or T- 
allele of the FGF-3 gene" if the genome of the mammal 



comprises one or more copies of the C-allele or T-allele. 

A mammal is "homozygous" for the C-allele or T- 
allele of the FGF-3 gene if the genome of the mammal 
comprises two copies of the C-allele or T-allele. 

The "C-allele" of the FGF-3 gene refers to a 
mammalian FGF-3 gene having a cytosine residue at 
position 69 in SEQ ID NO: 1, or at the position -6693 bp 
upstream of the ATG codon of FGF-3 gene. 

The "T-allele" of the FGF-3 gene refers to a 
mammalian FGF-3 gene having a thymine residue at position 
69 in SEQ ID NO: 1, or at the position -6693 bp upstream 
of the ATG codon of FGF-3 gene. 

With reference to nucleic acids of the invention, 
the term "isolated nucleic acid" or "isolated 
polynucleotide" is sometimes used. This term, when 
applied to DNA, refers to a DNA molecule that is 
separated from sequences with which it is immediately 
contiguous (in the 5' and 3' directions) in the naturally 
occurring genome of the organism from which it 
originates. For example, the "isolated nucleic acid" may 
comprise a DNA or cDNA molecule inserted into a vector, 
such as a plasmid or virus vector, or integrated into the 
DNA of a prokaryote or eukaryote. 

With respect to RNA molecules of the invention, the 
term "isolated nucleic acid" primarily refers to an RNA 
molecule encoded by an isolated DNA molecule as defined 
above. Alternatively, the term may refer to an RNA 
molecule that has been sufficiently separated from RNA 
molecules with which it would be associated in its 
natural state (i.e., in cells or tissues), such that it 
exists in a "substantially pure" form (the term 
"substantially pure" is defined below) . 

The term "promoter region" refers to the 
transcriptional regulatory regions of a gene, which may 
be found at the 5' or 3 ' side of the coding region, or 
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within the coding region, or within introns . 

The term "vector" refers to a small carrier DNA 
molecule into which a DNA sequence can be inserted for 
introduction into a host cell where it will be 
5 replicated. An "expression vector" is a specialized 

vector that contains a gene with the necessary regulatory 
regions needed for expression in a host cell. 

The term "operably linked" means that the regulatory 
sequences necessary for expression of the coding sequence 

10 are placed in the DNA molecule in the appropriate 

positions relative to the coding sequence so as to effect 
expression of the coding sequence. This same definition 
is sometimes applied to the arrangement of coding 
sequences and transcription control elements (e.g. 

15 promoters, enhancers, and termination elements) in an 

expression vector. This definition is also sometimes 
applied to the arrangement of nucleic acid sequences of a 
first and a second nucleic acid molecule wherein a hybrid 
nucleic acid molecule is generated. 

2 0 The term "substantially pure" refers to a 

preparation comprising at least 50-60% by weight the 
compound of interest (e.g., nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 7 5% by weight, and most 

25 preferably 90-99% by weight, the compound of interest. 

Purity is measured by methods appropriate for the 
compound of interest (e.g. chromatographic methods, 
agarose or polyacrylamide gel electrophoresis, HPLC 
analysis, and the like) . 

30 The phrase "consisting essentially of" when 

referring to a particular nucleotide or amino acid means 
a sequence having the properties of a given SEQ ID NO: . 
For example, when used in reference to an amino acid 
sequence, the phrase includes the sequence per se and 

35 molecular modifications that would not affect the 
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functional and novel characteristics of the sequence. 

As used herein, the terms "reporter, " "reporter 
system", "reporter gene," or "reporter gene product" 
shall mean an operative genetic system in which a nucleic 
5 acid comprises a gene that encodes a product which when 

expressed produces a reporter signal that is readily 
measurable, e.g., by biological assay, immunoassay, 
radioimmunoassay, or by colorimetric , fluorogenic, 
chemi luminescent or other method. The nucleic acid may 

10 be either RNA or DNA, linear or circular, single or 

double stranded, antisense or sense polarity, and is 
operatively linked to the necessary control elements for 
the expression of the reporter gene product. The 
required control elements will vary according to the 

15 nature of the reporter system and whether the reporter 

gene is in the form of DNA or RNA, and may include, but 
not be limited to, such elements as promoters, enhancers, 
translational control sequences, poly A addition signals, 
transcriptional termination signals and the like. 

20 The terms "transform", "transfect", "transduce", 

shall refer to any method or means by which a nucleic 
acid is introduced into a cell or host organism and may 
be used interchangeably to convey the same meaning. Such 
methods include, but are not limited to, transf ection, 

25 electroporation, microinjection, PEG-fusion and the like. 

The introduced nucleic acid may or may not be integrated 
(covalently linked) into nucleic acid of the recipient 
cell or organism. In bacterial, yeast, plant and 
mammalian cells, for example, the introduced nucleic acid 

30 may be maintained as an episomal element or independent 

replicon such as a plasmid. Alternatively, the 
introduced nucleic acid may become integrated into the 
nucleic acid of the recipient cell or organism and be 
stably maintained in that cell or organism and further 

35 passed on to or inherited by progeny cells or organisms 
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of the recipient cell or organism. In other 
applications, the introduced nucleic acid may exist in 
the recipient cell or host organism only transiently. 

The term "oligonucleotide, " as used herein refers to 
primers and probes of the present invention, and is 
defined as a nucleic acid molecule comprised of two or 
more ribo- or deoxyribonucleotides , preferably more than 
three. The exact size of the oligonucleotide will depend 
on various factors and on the particular application and 
use of the oligonucleotide. 

The term "probe" as used herein refers to an 
oligonucleotide, polynucleotide or nucleic acid, either 
RNA or DNA, whether occurring naturally as in a purified 
restriction enzyme digest or produced synthetically, 
which is capable of annealing with or specifically 
hybridizing to a nucleic acid with sequences 
complementary to the probe. A probe may be either 
single-stranded or double-stranded. The exact length of 
the probe will depend upon many factors, including 
temperature, source of probe and the method used. For 
example, for diagnostic applications, depending on the 
complexity of the target sequence, the oligonucleotide 
probe typically contains 15-2 5 or more nucleotides, 
although it may contain fewer nucleotides. The probes 
herein are selected to be "substantially" complementary 
to different strands of a particular target nucleic acid 
sequence. This means that the probes must be 
sufficiently complementary so as to be able to 
"specifically hybridize" or anneal with their respective 
target strands. Therefore, the probe sequence need not 
reflect the exact complementary sequence of the target. 
For example, a non- complementary nucleotide fragment may 
be attached to the 5' or 3 ' end of the probe, with the 
remainder of the probe sequence being complementary to 
the target strand. Alternatively, non-complementary 
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bases or longer sequences can be interspersed into the 
probe, provided that the probe sequence has sufficient 
complementarity with the sequence of the target nucleic 
acid to anneal therewith specifically. 
5 The term "specifically hybridize" refers to the 

association between two single-stranded nucleic acid 
molecules of sufficiently complementary sequence to 
permit such hybridization under pre-determined conditions 
generally used in the art (sometimes termed 

10 "substantially complementary"). In particular, the term 

refers to hybridization of an oligonucleotide with a 
substantially complementary sequence contained within a 
single- stranded DNA or RNA molecule of the invention, to 
the substantial exclusion of hybridization of the 

15 oligonucleotide with single-stranded nucleic acids of 

non-complementary sequence . 

The term "primer" as used herein refers to an 
oligonucleotide, either RNA or DNA, either 

single-stranded or double-stranded, either derived from a 

2 0 biological system, generated by restriction enzyme 

digestion, or produced synthetically which, when placed 
in the proper environment, is able to functionally act as 
an initiator of template-dependent nucleic acid 
synthesis. When presented with an appropriate nucleic 
25 acid template, suitable nucleoside triphosphate 

precursors of nucleic acids, a polymerase enzyme, 
suitable cofactors and conditions such as a suitable 
temperature and pH, the primer may be extended at its 3' 
terminus by the addition of nucleotides by the action of 

3 0 a polymerase or similar activity to yield a primer 

extension product. The primer may vary in length 
depending on the particular conditions and requirement of 
the application. For example, in diagnostic 
applications, the oligonucleotide primer is typically 
3 5 15-25 or more nucleotides in length. The primer must be 
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of sufficient complementarity to the desired template to 
prime the synthesis of the desired extension product, 
that is, to be able to anneal with the desired template 
strand in a manner sufficient to provide the 3 ' hydroxyl 
moiety of the primer in appropriate juxtaposition for use 
in the initiation of synthesis by a polymerase or similar 
enzyme. It is not required that the primer sequence 
represent an exact complement of the desired template. 
For example, a non-complementary nucleotide sequence may 
be attached to the 5 ' end of an otherwise complementary 
primer. Alternatively, non-complementary bases may be 
interspersed within the oligonucleotide primer sequence, 
provided that the primer sequence has sufficient 
complementarity with the sequence of the desired template 
strand to functionally provide a template-primer complex 
for the synthesis of the extension product. 

"Complementary" refers to the broad concept of 
sequence complementarity between regions of two nucleic 
acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a 
first nucleic acid region is capable of forming specific 
hydrogen bonds ("base pairing") with a residue of a 
second nucleic acid region which is anti-parallel to the 
first region if the residue is thymine or uracil. 
Similarly, it is known that a cytosine residue of a first 
nucleic acid strand is capable of base pairing with a 
residue of a second nucleic acid strand which is anti- 
parallel to the first strand if the residue is guanine. 
A first region of a nucleic acid is complementary to a 
second region of the same or a different nucleic acid if, 
when the two regions are arranged in an anti-parallel 
fashion, at least one nucleotide residue of the first 
region is capable of base pairing with a residue of the 
second region. Preferably, the first region comprises a 
first portion and the second region comprises a second 
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portion, whereby, when the first and second portions are 
arranged in an anti-parallel fashion, at least about 50%, 
and preferably at least about 75%, at least about 90%, or 
at least about 95% of the nucleotide residues of the 
5 first portion are capable of base pairing with nucleotide 

residues in the second portion. More preferably, all 
nucleotide residues of the first portion are capable of 
base pairing with nucleotide residues in the second 
portion . 

10 A first region of an oligonucleotide "flanks" a 

second region of the oligonucleotide if the two regions 
are adjacent one another or if the two regions are 
separated by no more than about 1000 nucleotide residues, 
and preferably no more than about 100 nucleotide 

15 residues . 

A second set of primers is "nested" with respect to 
a first pair of primers if, after amplifying a nucleic 
acid using the first pair of primers, each of the second 
pair of primers anneals with the amplified nucleic acid, 

2 0 such that the amplified nucleic acid can be further 

amplified using the second pair of primers. 

As used herein, an "instructional material" includes 
a publication, a recording, a diagram, or any other 
medium of expression which can be used to communicate the 

25 usefulness of the composition of the invention for 

performing a method of the invention or for associating 
the presence of a C-allele of the FGF-3 gene in a mammal 
with carcinogenic susceptibility. The instructional 
material of the kit of the invention can, for example, be 

30 affixed to a container which contains a kit of the 

invention or be shipped together with a container which 
contains the kit. Alternatively, the instructional 
material can be shipped separately from the container 
with the intention that the instructional material and 

35 the kit be used cooperatively by the recipient. 
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II. Single Nucleotide Polymorphism (SNP) in the FGF-3 
Gene 

In accordance with the present application, a SNP in 
the FGF-3 gene has been identified which is predictive of 
an individual's risk for certain types of cancer, 
particularly esophageal cancer. 

Accordingly, the present invention provides a 
nucleic acid molecule comprising the sequence of SEQ ID 
NO: 1 and the complement thereof. 

Nucleic acid molecules of the present invention may 
be prepared by two general methods: (1) Synthesis from 
appropriate nucleotide triphosphates, or (2) Isolation 
from biological sources. Both methods utilize protocols 
well known in the art. 

The availability of nucleotide sequence information, 
such as a full length nucleic acid sequence having SEQ ID 
NO: 1, enables preparation of isolated nucleic acid 
molecules of the invention by oligonucleotide synthesis. 
Synthetic oligonucleotides may be prepared by the 
phosphoramidite method employed in the Applied Biosystems 
38A DNA Synthesizer or similar devices. The resultant 
construct may be purified according to methods known in 
the art, such as high performance liquid chromatography 
(HPLC) . Long, double- stranded polynucleotides, such as a 
DNA molecule of the present invention, must be 
synthesized in stages, due to the size limitations 
inherent in current oligonucleotide synthetic methods. 
Thus, for example, a 1.4 kb double- stranded molecule may 
be synthesized as several smaller segments of appropriate 
complementarity. Complementary segments thus produced 
may be annealed such that each segment possesses 
appropriate cohesive termini for attachment of an 
adjacent segment. Adjacent segments may be ligated by 
annealing cohesive termini in the presence of DNA ligase 
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to construct an entire 1.4 kb double- stranded molecule. 
A synthetic DNA molecule so constructed may then be 
cloned and amplified in an appropriate vector. 

Nucleic acid sequences of the present invention may 
also be isolated from appropriate biological sources 
using methods known in the art. 

Also contemplated with the scope of the present 
invention are vectors or plasmids containing the nucleic 
acid sequence of SEQ ID NO: 1, and host cells or animals 
containing such vectors or plasmids. Methods for 
constructing vectors or plasmids containing the nucleic 
acid sequence of SEQ ID NO : 1, and host cells or animals 
containing the same are within the ability of persons 
skilled in the art of molecular biology. 

III. Genotyping of FGF-3 Gene 

According to one aspect of the present invention, a 
method for determining whether a human is homozygous for 
the C-allele, heterozygous for the C- and T-alleles, or 
homozygous for the T-allele of the FGF-3 gene is 
provided. Substantially any method of detecting an 
allele of the FGF-3 gene, such as hybridization, 
amplification, restriction enzyme digestion, and 
sequencing methods, can be used. 

In one embodiment, an allelic discrimination method 
for identifying the FGF-3 genotype of a human was used 
(Figure 3) . The allelic discrimination method of the 
invention involves use of a first oligonucleotide probe 
which anneals with a target portion of the mammal's 
genome. The target portion comprises a portion of the 
upstream untranslated region of FGF-3 gene of the mammal, 
including the nucleotide residue at position 69 in SEQ ID 
NO: 1. Because the nucleotide residue at this position 
differs in the C-allele and the T-allele, the first probe 
is completely complementary to only one of the two 
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alleles. Alternatively, a second oligonucleotide probe 
can also be used which is completely complementary to the 
target portion of the other of the two alleles. The 
allelic discrimination method of the invention also 
involves use of at least one, and preferably a pair of 
amplification primers for amplifying a reference region 
of the FGF-3 gene of the mammal. The reference region 
includes at least a portion of the target portion, and 
preferably includes the nucleotide residue at position 69 
of the FGF-3 gene in SEQ ID NO : 1. 

Because the reference region and the target portion 
overlap by at least one base, preferably by at least 
about half the length of the target portion, and more 
preferably completely overlap, the enzyme (e.g. Thermus 
aquaticus {Taq} DNA polymerase) which catalyzes the 
amplification reaction and the first (or second) probe 
will collide. If the probe is not completely 
complementary to the target portion, it is more likely to 
dissociate from the target portion upon collision than if 
it is completely complementary. Therefore, unless the 
enzyme exhibits 5 1 ->3 ' exonuclease activity, 
amplification ceases or is greatly inhibited. 

If the enzyme which catalyzes the amplification 
reaction exhibits 5 ' ->3 ' exonuclease activity (e.g. Taq 
DNA polymerase) , then the enzyme will at least partially 
degrade the 5 ' -end of a probe with which it collides 
unless the probe dissociates from the target portion upon 
collision with the enzyme. As noted above, if the probe 
is not completely complementary to the target portion, it 
is much more likely to dissociate from the target portion 
upon collision than if it is completely complementary. 
If a detectable label is attached to a nucleotide residue 
at or near the 5 1 -end of the probe, release of the 
detectable label from the probe can be used as an 
indication that the enzyme and probe have collided and 
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that the probe did not dissociate from the target 
portion. Thus, release of the detectable label from the 
probe upon amplification of the region indicates that the 
probe was completely complementary to the target portion. 
By selecting either or both of a probe completely 
complementary to the target portion of the C-allele of 
the FGF-3 gene and a probe completely complementary to 
the target portion of the T-allele of the gene and 
assessing release of the label from the probe(s), the 
identity of the allele (s) can be ascertained. 

The probe is preferably a DNA oligonucleotide having 
a length in the range from about 2 0 to about 40 
nucleotide residues, preferably from about 2 0 to about 3 0 
nucleotide residues, and more preferably having a length 
of about 25 nucleotide residues. In one embodiment, the 
probe is rendered incapable of extension by a PCR- 
catalyzing enzyme such as Taq polymerase, for example by 
having a fluorescent probe attached at one or both ends 
thereof. Although non-labeled oligonucleotide probes can 
be used in the kits and methods of the invention, the 
probes are preferably detectably labeled. Exemplary 
labels include radionuclides, light-absorbing chemical 
moieties (e.g. dyes), fluorescent moieties, and the like. 
Preferably, the label is a fluorescent moiety, such as 6- 
carboxyf luorescein (FAM) , 6-carboxy-4 , 7 , 2 ' , 7 ' - 
tetrachlorof luoroscein (TET) , rhodamine, JOE (2,7- 
dimethoxy-4, 5-dichloro-6-carboxyf luorescein) , HEX 
(hexachloro-6-carboxyf luorescein) , or VIC. 

In a particularly preferred embodiment, the probe of 
the invention comprises both a fluorescent label and a 
fluorescence-quenching moiety such as 6-carboxy- 
N,N,N' , N 1 - tetramethylrhodamine (TAMRA) , or 4- (4'- 
dimethlyaminophenylazo) benzoic acid (DABCYL) . When the 
fluorescent label and the fluorescence-quenching moiety 
are attached to the same oligonucleotide and separated by 
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no more than about 40 nucleotide residues, and preferably 
by no more than about 3 0 nucleotide residues, the 
fluorescent intensity of the fluorescent label is 
diminished. When one or both of the fluorescent label 
5 and the fluorescence-quenching moiety are separated from 

the oligonucleotide, the intensity of the fluorescent 
label is no longer diminished. Preferably, the probe of 
the invention has a fluorescent label attached at or near 
(i.e. within about 10 nucleotide residues of) one end of 

10 the probe and a fluorescence-quenching moiety attached at 

or near the other end. Degradation of the probe by a 
PCR-catalyzing enzyme releases at least one of the 
fluorescent label and the fluorescence-quenching moiety 
from the probe, thereby discontinuing fluorescence 

15 quenching and increasing the detectable intensity of the 

fluorescent labels. Thus, cleavage of the probe (which, 
as discussed above, is correlated with complete 
complementarity of the probe with the target portion) can 
be detected as an increase in fluorescence of the assay 

20 mixture. 

If detectably different labels are used, more than 
one labeled probe can be used. For example, the assay 
mixture can contain a first probe which is completely 
complementary to the target portion of the C-allele of 

25 the FGF-3 gene and to which a first label is attached, 

and a second probe which is completely complementary to 
the target portion of the T-allele. When two probes are 
used, the probes are detectably different from each 
other, having, for example, detectably different size, 

30 absorbance, excitation, or emission spectra, radiative 

emission properties, or the like. For example, a first 
probe can be completely complementary to the target 
portion of the C-allele and have FAM and TAMRA attached 
at or near opposite ends thereof. The first probe can be 

3 5 used in the method of the invention together with a 
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second probe which is completely complementary to the 
target portion of the T-allele and has TET and TAMRA 
attached at or near opposite ends thereof. Fluorescent 
enhancement of FAM (i.e. effected by cessation of 
fluorescence quenching upon degradation of the first 
probe by Taq polymerase) can be detected at one 
wavelength (e.g. 518 nanometers ) , and fluorescent 
enhancement of TET (i.e. effected by cessation of 
fluorescence quenching upon degradation of the second 
probe by Taq polymerase) can be detected at a different 
wavelength (e.g. 582 nanometers). 

Ideally, the probe exhibits a melting temperature 
(Tm) within the range from about 60 to 70°C., and more 
preferably in the range from 65 to 67 °C. Furthermore, 
because each probe is completely complementary to only 
one of the C- and T-alleles of the FGF-3 gene, each probe 
will necessarily have at least one nucleotide residue 
which is not complementary to the corresponding residue 
of the other allele. This non-complementary nucleotide 
residue of the probe is preferably located near the 
midsection of the probe (i.e. within about the central 
third of the probe sequence) and is preferably 
approximately equidistant from the ends of the probe. 
Thus, for example, the probe which is completely 
complementary to the C-allele of the human FGF-3 gene 
can, for example, be completely complementary to 
nucleotide residues 58 to 77 of the C-allele, as defined 
by the positions of SEQ ID NO: 1. Because the C- and T- 
alleles differ at position 69, this probe will have a 
mismatched base pair nine nucleotide residues from one 
end when it is annealed with the corresponding target 
portion of the T-allele . 

By way of example, labeled probes having the 
sequences of SEQ ID NOs : 7 and 8 can be used, in 
conjunction with amplification primers having the 
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sequences of SEQ ID NOs : 5 and 6 in order to determine 
the allelic content of a mammal (i.e. to assess whether 
the mammal comprises one or both of an C allele and a T 
allele of FGF-3) . 

The size of the reference portion which is amplified 
according to the allelic discrimination method of the 
invention is preferably not more than about 100 
nucleotide residues. It is also preferred that the Tm 
for the amplified reference portion with the genomic DNA 
or fragment thereof be in the range from about 57 to 
61°C # where possible. 

It is understood that binding of the probe (s) and 
primers and that amplification of the reference portion 
of the FGF-3 gene according to the allelic discrimination 
method of the invention will be affected by, among other 
factors, the concentration of Mg ++ in the assay mixture, 
the annealing and extension temperatures, and the 
amplification cycle times. Optimization of these factors 
requires merely routine experimentation which are well 
known to skilled artisans. 

Another allelic discrimination method suitable for 
use in the present invention employs "molecular beacons". 
Detailed description of this methodology can be found in 
Kostrikis et al . , Science 1998;279:1228-1229, which is 
incorporated herein by reference. 

The use of microarrays comprising a multiplicity of 
reference sequences is becoming increasingly common in 
the art. Accordingly, another aspect of the invention 
comprises a microarray having at least one 
oligonucleotide probe, as described above, appended 
thereon. In a particular embodiment, the at least one 
oligonucleotide probe consists essentially of the 
nucleotide sequence of SEQ ID NO : 6 or SEQ ID NO: 7. 

It is understood, however, that any method of 
ascertaining an allele of a gene can be used to assess 
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the genotype of the FGF-3 gene in a mammal. Thus, the 
invention includes known methods (both those described 
herein and those not explicitly described herein) and 
allelic discrimination methods which may be hereafter 
5 developed . 

IV. Assessment of Predisposition to Cancer 

The present invention also provides a method of 
assessing the relative susceptibility of a mammal (e.g. a 

10 human) to a cancer such as a esophageal squamous cell 

carcinoma. The "relative" susceptibility of a mammal to 
the cancer refers to the fact that, among a population of 
individuals exposed to equivalent carcinogenic stimuli, 
some individuals are more likely to develop cancers than 

15 others. This differential carcinogenic potential is 

attributable, at least in part to the genetic makeup of 
the individuals in the population. Germ-line mutations 
in tumor suppressor genes (e.g. Rb, p53, BRCAl, WT1 , and 
the like) are one mechanism by which genetic differences 

20 contribute to predisposition to certain types of cancer. 

Mutations in other genes, such as oncogenes and genes 
encoding proteins involved in DNA repair, have also been 
implicated in carcinogenesis. 

In accordance with the present invention, it has 

25 been discovered that the presence of a C-allele of the 

FGF-3 gene is correlated with greater susceptibility to 
carcinogenesis in mammals, particularly in humans. The 
effect is furthermore dose-dependent, meaning that a 
first individual who is homozygous for the C-allele has a 

30 greater susceptibility than a second individual who is 

heterozygous for the C- and T-alleles, and that the 
second individual has a greater susceptibility than a 
third individual who is homozygous for the T-allele. 
Thus, the method of the invention for assessing the 

35 relative susceptibility of a mammal to a cancer comprises 
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determining whether the mammal comprises a C-allele, a T- 
allele or both in the 5' untranslated region of the FGF-3 
gene . 

V. Kit 

The present invention also provides a kit for 
performing the instant method disclosed herein. The kit 
comprises a plurality of reagents useful for performing 
the disclosed methods, and optionally further comprises 
an instructional material which describes how the method 
is performed. 

By way of example, an exemplary kit for performing 
the allelic discrimination method of the invention 
comprises : 

a) a first oligonucleotide probe which anneals 
specifically with target portion of the mammal's genome, 
wherein the target portion includes the nucleotide 
residue located at position 69 of SEQ ID NO: 1, the probe 
comprising a fluorescent label and a fluorescence 
quencher attached to separate nucleotide residues 
thereof, and 

b) a primer for amplifying a reference portion of 
the promoter region of FGF-3 gene, the reference portion 
including the nucleotide residue located at position 69 
as defined by the sequence of SEQ ID NO: 1. 

The kit may further comprise a DNA polymerase having 
5 ' ->3 ' exonuclease activity. The kit may also comprise a 
second oligonucleotide probe having a different annealing 
specificity than the first (e.g. wherein the first is 
completely complementary to the target portion of the C- 
allele and the second is completely complementary to the 
target portion of the T-allele) , a second primer (e.g. 
such that this and the other primer can be used to 
amplify at least the target portion by a PCR) , or both. 
The kit may comprise an instructional material which can, 
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for example, describe performance of the allelic 
discrimination method, the association between the 
presence of the C-allele and carcinogenic susceptibility, 
or both. 

5 In an alternative embodiment of the present 

invention, the kit comprises at least one, and preferably 
two molecular beacon probes, as described herein. When 
the kit comprises two molecular beacon probes, one is 
preferably specific for (i.e. completely complementary to 

10 a region including nucleotide residue 69 of SEQ ID NO: 1) 

the C-allele of the FGF-3 gene, and the other is specific 
for the T-allele. This kit may further comprise an 
instructional material, as described above. 

Also provided by the present invention are kits for 

15 assessing the susceptibility of a mammal to a cancer 

according to the one or more of the methods of the 
invention. The kit comprises a plurality of reagents 
useful for performing one of the methods as described 
above, and optionally further comprises an instructional 

2 0 material which describes how the method is performed and 

the association between the presence of the C-allele and 
carcinogenic susceptibility . 

Although the foregoing disclosure is principally 
directed to kits and methods which are applicable to 

25 human cancers, it will be understood by the skilled 

artisan that such methods and kits are generally 
applicable to cancers of mammals of all sorts. 
Modification, where necessary, of the kits and methods of 
the invention to conform to non-human cancers is well 

30 understood, and the ordinarily skilled veterinary worker 

can design and perform such modification with merely 
ordinary, if any, experimentation. Representative 
mammals include, for example, primates, cattle, pigs, 
horses, sheep, cats, and dogs. 
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The kits and methods described herein are applicable 
for substantially any cancer, including, for example, 
esophageal carcinoma. Other cancers for which the 
compositions, kits, and methods described herein can be 
used include breast, ovarian, prostate, and head and neck 
cancer . 



The following examples are provided to illustrate 
certain embodiments of the invention. They are not 
intended to limit the invention in any way. 

EXAMPLE I 

Identification of a Single Nucleotide 
Polymorphism (SNP) in the FGF-3 Gene 

In this study, genomic DNA was obtained from 81 
human subjects (39 Caucasians, 42 Chinese) , and used for 
amplifying a 772 bp fragment which includes the 5'- 
proximal region of the FGF3 promoter. The primers used 
for this PCR procedure are as follows: 

Forward primer: 5' - gca, gcc, ctg, cct, cag, 

aaa, ac - 3 ' ( SEQ ID NO : 2) 

Reverse primer: 5' - tgc, acc, cca, ctt, eta, 

gca, tea, g -3' (SEQ ID NO: 3) 

By direct sequencing of the 772 bp fragment, a 
single nucleotide polymorphism (SNP) , C/T, was discovered 
at position 6693 bp upstream of the ATG start codon of 
FGF3 coding sequence, i.e., position 69 of SEQ ID NO: 1. 
(Fig. 1) 

EXAMPLE II 
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Trans feet ion Assays 

To study the FGF-3 promoter activity for the C- and 
T-alleles of FGF-3 in vivo, two reporter constructs were 
generated. In these constructs, a fragment corresponding 
to nucleic acids 6761 to 6198 upstream of the ATG codon 
of human FGF-3 gene with either a C or T at position 6693 
(SEQ ID NO: 1, Figure 1) was cloned into pGL3 plasmid 
(Promega Corp., Madison, WI) and operably linked with a 
modifier firefly lucif erase gene. Using LIPOFECTIN 
Reagent (Life Technologies, Cat# : 18292-011), these C- or 
T-allele reporter constructs were transiently transfected 
into NIH3T3 cells. Lucif erase activities were then 
assayed in cell extracts harvested 24 hours post- 
transf ection. Cells transfected with empty pGL3 plasmid 
were used as controls. The results, listed in Table 2, 
reveal that the presence of the T-allele of the FGF-3 
gene is associated with elevated promoter activity when 
compared with the C-allele. 
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Table 1. Functional analysis of the two human FGF-3 

alleles 



10 





Lucif erase 


activitv a 


Experiment 


C allele 


T allele 


1 


31.03 


92.49 


2 


14.90 


23 . 64 


3 


46 .38 


73 .20 


4 


18.93 


26.73 


Meant SD 


27. 8± 14.15 


54.01± 34.23 



a Ratio of relative firefly luminescence units/ug of 
protein to relative renilla luminescence units/ug of 
protein measured in the same cell extract. 

15 

EXAMPLE III 
Cancer Susceptibility Assessment 

As described above, allelic discrimination methods, 
more specifically, TaqMan-based genotyping was used to 
20 determine the genotype of FGF-3 gene in human subjects. 

The following are the primers and probes used in these 
studies : 

Primers : 

25 Fgf3/snp #F: 5'- get, tea, ccc, cag, aga, tga, ggg 

-3' (SEQ ID NO: 4) 
Fgf3/snp #R: 5'- age, tgt, atg, cag, ccc, ctg, tg 

-3 ' (SEQ ID NO: 5) 

30 Probes : 

Probe #1: Vic- etc, cct, cac, etc, cag, cca, cat, g- 

TAMRA (SEQ ID NO : 6) 
Probe #2: 6FAM- etc, cct, cgc, etc, cag, cca, cat, g- 

TAMRA (SEQ ID NO : 7) 

35 

By a TaqMan-based genotyping assay, the FGF-3 
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genotype distribution in DNA samples from 64 African 
American, 7 8 Caucasian, and 171 Chinese was determined. 
The results, shown in Table 2 and Figure 3, clearly 
suggest that ethnic differences exist in FGF-3 allele 
5 frequencies . 

Table 2 . Distribution of Human FGF-3 Gene in Ethnic 

Groups 



3roup 


CC 


CT 


TT 


Total 


Africa American 


42 


18 


4 


64 


Caucasian 


57 


19 


2 


78 


Chinese 


64 


58 


49 


171 












3 roup 


CC 


CT 


TT 




Africa American 


66% 


28% 


6% 




Caucasian 


73% 


24% 


3% 




Chinese 


37% 


34% 


29% 





2 0 A case-controlled study of esophageal cancer in 

China was also conducted by determining the distribution 
of C-allele FGF-3 gene among human subjects. In this 
study, the controls (n=171) were well matched on cases 
(n=168) for sex, age, and smoking status (Table 3). The 

25 results reveal an adjusted odds ratio (OR) of esophageal 

cancer of 2.38 for the CC genotype and 1.98 for the CT 
genotype . 



25 
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Table 3. Characteristics of cases with esophageal 
squamous cell carcinoma (ESCC) and controls 





Variable 


Cases (n = 168) 


Controls 


(n =171) 


P value 


5 


Sex {%) 










0.498 




Male 


144 


( 85) 


142 


(83) 






Female 


24 


(15) 


29 


(17) 






Mean age (yr) (SD) 


58.1 


(9.7) 


58.9 


(5.0) 


0.480 




Smoking status (%) 










0.645 


10 


No 


61 


(36) 


58 


(34) 






Yes 


107 


(46) 


113 


(66) 






Table 4. FGF-3 


genotype 


in cases 


and controls 


and their 



association with risk of ESCC 



Genotype Cases (n= 168) Controls (n = 171) OR a ' b ' c (95% CI d ) 





No. 


% 


NO. 


% 






TT 


26 


15 . 5 


49 


28.7 


1.00 




TC 


61 


36.3 


58 


33.9 


1.98 (1.15-3 


. 83) 


CC 


81 


48.2 


64 


37.4 


2.38 (1.32-4 


.29) 


C allele frequency 


0. 


664 




0.544 






a ORs and 95% CIs 


were 


calculated 


by 


logistic 


regression, 





with the TT genotype as the reference group and adjusted for 
age, sex, and smoking status. 
25 b Trend test, P = 0.005. 

c OR: odds ratio. 
d CI: confidence intervals. 

The above results demonstrate that the presence of 
30 the C-allele in the FGF-3 gene 5' untranslated region is 

correlated with greater susceptibility to esophageal 
cancer. Additionally, the effect is dose-dependent, 
i.e., a subject who is homozygous for the C-allele FGF-3 
gene has a higher risk of developing esophageal cancer 
35 relative to subjects who are heterozygous (C-T) . 

Finally, heterozygous individuals are at a greater risk 
than subjects who are homozygous for the T-allele at this 
position in the FGF-3 gene. 
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While certain of the preferred embodiments of the 
present invention have been described and specifically 
exemplified above, it is not intended that the invention 
be limited to such embodiments. Various modifications 
may be made thereto without departing from the scope and 
spirit of the present invention, as set forth in the 
following claims . 
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